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Abstract 


The ancient geographical origins of Brahmins—a prominent ethnic group in the Indian subcontinent—have remained contro- 
versial for a long time. This study employed the AMOVA (analysis of molecular variance) test to evaluate genetic affinities of 
this group with thirty populations of Central Asia and Europe. A domestic comparison was performed with fifty non-Brahmin 
groups in India. The results showed that Brahmins had genetic affinities with several foreign populations and also shared 
their genetic heritage with several domestic non-Brahmin groups. The study identified the deep ancient origins of Brahmins 
by tracing their Y-chromosome haplogroups and genetic markers on the Y-DNA phylogenetic tree. It was confirmed that 
the progenitors of this group emerged from at least 12 different geographic regions of the world. The study concluded that 
about 83% of the Brahmins in the dataset belonged to four major haplogroups, of which two emerged from Central Asia, 
one from the Fertile Crescent, and one was of an indigenous Indian origin. 
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Introduction 


The Brahmins represent one of the largest ethnic groups in 
the Indian subcontinent with an estimated population of over 
56 million people (Outlook 2007). Their predecessors created 
Hinduism which is now the largest religion in India and the 
third largest in the world. The ancient geographic origins of 
Brahmins have been debated for a long time. Some early histo- 
rians believed that this ethnic group emerged from an ancient 
population known as the Aryans (Havell 1918). In this study, 
recent developments in DNA science were assessed to reveal 
that Brahmins had at least 12 ancient progenitors who, with 
one exception, originated outside the Indian subcontinent. 


Brahmins and Hinduism 


The ancient Vedic religion—also known as Vedism or 
Brahmanism—was founded by ancestors of Brahmins 
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between 2500 and 1500 BCE (Cooke 2011). The Vedic 
religion gradually evolved into Hinduism and became 
a fusion of various Indian cultures and traditions with 
diverse roots (Bowker 1997). In this religion, there are 
four varnas or castes—Brahmin, Kshatriya, Vaishya, and 
Shudra—that signify a person’s profession and status in 
life (Sharma 2000). The Brahmins are the highest of these 
four castes in a stratified and endogamous social hierarchy 
of followers. They were self-designated as arya (“noble”) 
in Sanskrit—the lingua franca of ancient India—and their 
abode was known as aryavarta, or the land of the Aryans 
(Parpola 2015). Those in India have been referred to as 
Indo-Aryans. 

Traditionally, Brahmins have specialized as Hindu 
priests, protectors of sacred learning, teachers, philosophers, 
and leaders (Ingold 1994; Cavalli-Sforza 2000). They have 
also served as agriculturists, warriors, traders and worked in 
other occupations (Doniger 1999). There are several unique 
Brahmin communities that have spread throughout India, 
with different customs, cuisines, and languages. 


Historical perspectives 


The earliest texts written in Sanskrit that describe basic 
tenets of the Vedic religion and shed some light on the 
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past are collectively known as the Vedas (knowledge). 
The oldest of these texts known as the Rig Veda, which 
is a collection of Vedic hymns, was passed down orally 
for centuries and slowly written down between 1500 and 
500 BCE (Flood 1996; Joshi 2016). Another text, the 
Mahabharata, was composed between 400 BCE and 400 
CE (Molloy 2008). These texts are more devoted to reli- 
gious ideas than history but provide some glimpses of 
the existence of agriculture, horses, chariots, and some 
ancient wars. Besides the Vedic texts there is no reliable 
written record of ancient Indian history before Alexander 
the Great’s campaign of India in 327 BCE (Smith 1921). 
Written history of India is less than 2500 years old. 


First arrivals in the Indian subcontinent 


It is generally well accepted that all Homo sapiens or mod- 
ern humans are descendants of ancestors who migrated 
out of Africa in multiple dispersals between ~37 and ~ 135 
kya (1000 years ago) (Reyes-Centeno et al. 2015). There 
is archeological evidence of H. sapiens fossils in the pre- 
historic caves of Qafzeh and Shkul, in present day Israel. 
Of these, the oldest fossils have been dated to~ 110 kya 
with mass spectrometric techniques (McDermott et al. 
1993). Stone relics made by humans dated to ~ 106 kya 
have been discovered in Oman, Arabia (Rose et al. 2011). 
Another set of fossils, the remains of a mother and child 
buried together ~ 90 kya, were discovered at Qafzeh (Bry- 
son 2005). Some humans traveled east toward India and 
others dispersed further north into central Asia and other 
parts of the world. 

There are two conflicting models about when the 
first wave of humans arrived in the Indian subcontinent. 
Michael Petraglia and his team of archaeologists have pro- 
vided evidence that humans were present in South India 
both before and after the Mount Toba eruption that took 
place in Indonesia about 75 kya (Petraglia et al. 2007, 
2010; Oppenheimer 2011; Blinkhorn et al. 2014). Another 
study has supported a more recent occupation of India 
beginning around ~ 50 to 60 kya (Mellars et al. 2013). 
Because humans had already reached the Arabian Penin- 
sula and the Middle East more than 100 kya (McDermott 
et al. 1993; Rose et al. 2011), and India lies close to Africa 
and Arabia, the pre-Toba model appears more plausible. 

Some humans arrived in India and continued their 
migration to Sri Lanka, the Andaman Islands in the 
Bay of Bengal, East Asia, and Australasia. Recent fos- 
sil evidence of modern H. sapiens has been recovered in 
Sri Lanka, dated to ~ 30 kya (Kennedy et al. 1987). The 
isolated extant communities of the Onge and the Great 
Andamanese in the Andaman Islands evolved from human 
migrations through India (Thangaraj et al. 2005). A study 
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conducted by the National Geographic Society found that 
people living in a village near Madurai in South India 
carried the same rare genetic markers as some Austral- 
ian aborigines and people living in Africa, confirming a 
genetic link between the three continents (Wells 2007). 


Indus Valley Civilization 


The mode of survival for the first arrivals in the subcontinent 
was a hunting and gathering tradition. As the population 
grew, agriculture developed and gradually replaced the ear- 
lier practices. There are indications that agriculture began in 
the Fertile Crescent, in the area of Elam around the Zagros 
mountains, in modern south-western Iran,~9 to~ 11 kya 
(McElreavey and Quintana-Murci 2005; Riehl et al. 2013; 
Gangal et al. 2014; Broushaki et al. 2016; Lazaridis et al. 
2016). A group of farmers from this area migrated eastward 
to northwestern India, mixed with the local hunter-gatherers 
and laid foundations for the Indus Valley Civilization (IVC) 
(Kenoyer 1991; Coningham and Young 2015; Subramanian 
and Khan 2016; Pathak et al. 2018). The migrants spoke a 
language that may have fashioned the Dravidian family of 
languages in the Indian subcontinent (McAlpin 1981). In 
time, the people of [VC became known as the earliest agri- 
culturists in South Asia (Harris 1996). 

Also known as the Harappan Civilization, IVC arose in 
several sites which are now located in North India and Paki- 
stan. Originally thought to have evolved around 3750 BCE, 
research has pushed back the origins of this civilization 
to~ 10 kya (Khandekar 2012), consistent with the apparent 
arrival of agriculturists from the Fertile Crescent. With the 
advent of agriculture, and North India being a fertile region, 
IVC became a desirable destination for a multitude of migra- 
tions that have occurred over time. The IVC collapsed ~ 3.5 
kya, allegedly due to environmental degradation and the 
arrival of Indo-Aryans who became the dominant inhabit- 
ants (Cavalli-Sforza and Cavalli-Sforza 1995). 


Y-DNA haplogroups 


Recent developments in DNA science have revealed that 
differences and commonalities in the genetic structure of 
people can be used effectively to determine their origins and 
movements (Underhill 2003). The human Y-chromosome is 
paternally inherited without the shuffling effect of recombi- 
nation. It passes from father to son basically unchanged, but 
occasionally a random change known as a polymorphism or 
mutation occurs. Such polymorphisms—also called mark- 
ers—serve as beacons and can be mapped. 

Because the markers are intricately linked to geography 
and populations, they serve as important indicators for trac- 
ing paternal lineages and their ancient origins (Walsh 2001). 
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Thus, the Y-chromosome or Y-DNA haplogroup represents 
an ancestral clan whose genetic markers define the origins 
and migratory paths of its earliest ancestors, and it can be 
used to trace the paternal line of the individual going back 
in time (Jobling and Tyler-Smith 2003). 


Y-DNA phylogenetic tree 


There are twenty top level Y-DNA haplogroups in the stand- 
ardized global phylogeny (Karafet et al. 2008). They are 
identified by letters A through T, and their subclades are 
expressed as letters and numbers (e.g., G2, R1b1, E3b1b). 

The Y-DNA haplogroups contain many branches called 
subhaplogroups or subclades and form a phylogenetic tree 
of mankind (Y Chromosome Consortium 2002). The branch 
lengths of the tree are governed by the mutation rates. The 
markers on the phylogenetic tree provide pieces of evidence 
regarding the date and geographic origin of ancestors in the 
distant past. 

This study has relied on the top-level Y-DNA haplogroup 
as the primary gauge for identifying the deep geographic 
origins of Brahmins. The haplogroups were employed in 
conjunction with published literature and the Y-DNA Hap- 
logroup Tree maintained by the International Society of 
Genetic Geology (ISOGG 2016) to trace the ancient geo- 
graphic origins of Brahmins. 


Materials and methods 
Datasets 


A dataset of 1201 Y-chromosome profiles of Brahmins from 
diverse parts of India was compiled from databases and pub- 
lished sources. This included the Genographic Project data- 
base (Behar et al. 2007; Genographic 2016), with permission 
of the National Geographic Society, and seven published 
sources (Sengupta et al. 2006; Sharma et al. 2009; Zhao 
et al. 2009; Yadav et al. 2010, 2011; Ghosh et al. 2011; Par- 
vathy et al. 2011). A second dataset of 2196 was used from 
a previous study (Mahal and Matsoukas 2018), to compare 
Brahmins with non-Brahmins in the Indian subcontinent in 
the same haplogroups. 

The haplogroups of 840 or about 70% of the profiles in 
the Brahmins dataset were predetermined at source with 
actual Y-DNA samples of men. The remaining 361 or about 
30% of profiles contained haplotypes with 12 Y-DNA 
loci (DYS19, DYS385a, DYS385b, DYS389I, DYS389T, 
DYS390, DYS391, DYS392, DYD393, DYS437, DYS438 
and DYS439) on the Y-chromosome. The allele frequen- 
cies for these 361 haplotypes were employed to identify 
the predominant top level haplogroups with Whit Athey’s 
Haplogroup Predictor software (Athey 2006). Because the 


primary focus was on the first ancestor the subclades were 
not used. All data were merged and sorted according to 
some key Brahmin communities that could be identified. 


Haplogroup prediction software 


Several software programs are available that use math- 
ematical calculations to predict haplogroups from Y-DNA 
profiles. The proprietary Haplogroup Classifier, a software 
tool developed at the University of Arizona showed that 
using algorithms and data from Y-linked STRs, Y chromo- 
some haplogroups could be predicted with a high degree 
of accuracy (Schlecht et al. 2008). 

Another proprietary software tool—yHaplo—was 
developed at the DNA testing company 23andMe, to ena- 
ble researchers to identify Y-DNA haplogroups of men. 
This software has been run on more than 600,000 males 
in the 23andMe database to confirm haplogroup calls for 
several 100 men (Poznik 2016). 

Some other haplogroup prediction tools are available 
online in the public domain. In this study, Whit Athey’s 
Haplogroup Predictor software was used (Athey 2006). 
This tool was selected, because it was one of the first such 
programs and served as a model for others. It also provided 
a relatively easier method of data entry with batch files. 


Statistical validity of dataset 


In statistical analyses, as the population increases in size, 
the sample size increases at a diminishing rate and remains 
relatively constant when it reaches a size of 380 or more. 
A sample of 384 typically represents a population of one 
million or more (Krejcie and Morgan 1970). The main 
dataset of 1201 Brahmins used in this study is about three 
times larger and represents all different regions of India 
(Table 1). 

The dataset represents a margin of error of 3.0% ata 
confidence level of 95%. In other words, if a similar survey 
of haplogroups is conducted one hundred times among 
randomly selected Brahmins (i.e., 1201 x 100 = 120,100 
Brahmins in total), the distribution in haplogroups is 
expected to be about the same as in this study, with a 
margin of error of plus or minus 3.0%. The data used in 
the study are considered statistically significant, in terms 
of both the sample size and geographic representation. 


AMOVA (analysis of molecular variance) 
In addition to identifying the predominant top level hap- 


logroups with Whit Athey’s software, the datafile of 361 
haplotypes was also used to corroborate a potential nexus 
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Table 1 Geographic representation of Brahmins in the dataset used 


Region/province N Percent 
North 
Haryana 5 0.4 
Himachal Pradesh 80 6.7 
Jammu and Kashmir 173 14.4 
Punjab 88 13 
Sub-total 346 28.8 
South 
Andhra Pradesh 12. 1.0 
Karanatka 28 2.3 
Kerala 8 0.7 
South India 37 3.1 
Tamil Nadu 71 5.9 
Sub-total 156 13.0 
East 
Bihar 38 3:2. 
Jharkhand 17 14 
Bengal 46 3.8 
Sub-total 101 8.4 
West: 
Maharashtra 97 8.1 
Gujarat 65 5.4 
Rajasthan 58 4.8 
Sub-total 220 18.3 
Other Areas 
Madhya Pradesh 57 4.8 
Uttar Pradesh 48 4.0 
Other 273 22.7 
Sub-total 378 31.5 
Total 1201 100.0 


Brahmins representation 


of Brahmins with thirty populations of Central Asia and 
Europe in the world population database maintained at 
YHRD.org (Willuweit and Roewer 2015). 

The online statistical program AMOVA (analysis of 
molecular variance) at YHRD.org was employed to assess 
the genetic distance among Y-DNA haplotypes of Brah- 
mins and foreign populations (Excoffier et al. 1992). The 
Rst measure, which provides relatively unbiased estimates 
of population divergence, was used to determine the pro- 
portion of variance between haplotypes (Slatkin 1995). 


Results 
Genetic distance with foreign populations 


The results of the AMOVA test confirmed that the Brah- 
mins had genetic affinities with several foreign populations 
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Table 2 Results of the AMOVA study with Rst distance and number 
of haplotypes used in each foreign population 


Population Rst distance Haplotypes 
Finland 0.3387 1134 
Kazakhstan 0.1874 560 
Saudi Arabia 0.1640 597 
Estonia 0.1624 125 
France 0.1613 554 
Jordan 0.1546 164 
Altai, Xinjiang, China (Tuva) 0.1402 150 
Switzerland 0.1323 1481 
Mongolia (Mongolian) 0.1250 127 
Sweden 0.1223 702 
China (Han) 0.1150 631 
Czech Republic 0.1081 2033 
Croatia 0.0927 1339 
Russian Federation (Russian) 0.0909 204 
United Kingdom 0.0728 3910 
Italy (Italian) 0.0727 152 
Greece 0.0717 782 
Iraq 0.0694 1609 
Bosnia and Herzegovina 0.0673 100 
Afghanistan (Afghan) 0.0652 169 
Ukraine 0.0590 217 
Azerbaijan 0.0564 47 
Hungary (Hungarian) 0.0548 1308 
Slovenia 0.0526 305 
Uzbekistan 0.0482 176 
Iran (Iranian) 0.0371 104 
Hungary 0.0369 1308 
Turkey 0.0363 1460 
Kazakhstan 0.0347 560 
Xinjiang, China (Uighur) 0.0181 763 


Rst distance Brahmins and 30 population groups 


and provided insights into their genetic makeup (Table 2). 
The Rst values or genetic difference ranged from a small 
distance of 0.0181 with the Uighur population in Xinjiang, 
China, to a much larger distance of 0.3387 in Finland. The 
Rst values were below 0.05 for Uzbekistan, Iran, Hungary, 
Turkey, Kazakhstan, and the Uighur group in Xinjiang, 
China. The close genetic affinity with these geographic 
areas is evident, because many migrations to North India 
originated from or passed through these areas. 


Brahmin representation in Y-DNA haplogroups 


The analyses revealed that ancestors of Brahmins in the 
dataset belonged to fifteen different haplogroups. Twelve 
haplogroups were selected and three were discarded, 
because they contained insufficient data. The ancient 
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geographic origins of the haplogroups were identified 
from the Y-DNA Haplogroup Tree maintained at ISOGG 
(ISOGG 2016), and published sources. The 12 Y-DNA 
haplogroups—C, E, F, G, H, I, J, K, L, O, Q, and R— 
were used to identify 12 different genetic lineages and 
geographic origins of the Brahmins. The same 12 haplo- 
groups were used to compare the Brahmins with 50 other 
key non-Brahmin ethnic groups in the Indian subcontinent. 

The haplogroups of 1201 Brahmins and 2196 non-Brah- 
mins of the Indian subcontinent are displayed in Table 3. 
The results signify that because of admixture the Brahmins 
share the same underlying genetic heritage with several 
other ethnic communities in the Indian subcontinent, with 
the same ancestors and geographic origins. A recent study 
has shown that admixture between populations in India 
was ubiquitous from 1.9 to 4.2 kya years ago, explaining 
why haplogroups are shared by so many different ethnic 
groups (Moorjani et al. 2013). 

The geographic origins, key markers, and approximate 
age of all 12 haplogroups in the study are summarized in 
Table 4. A phylogenetic tree of the 12 haplogroups used in 
this study—with their key top-level markers starting from 
Y-Adam—appears in Fig. 1. 

About 83% of the Brahmins and about 77% of the 
non-Brahmins in the study belonged to the same four 


haplogroups R, J, H, and L, which are described below in 
order of Brahmin representation (highest to lowest). 
Haplogroup R 

Ancestral Markers on Phylogenetic Tree: 
M168—> M89—> M9—> M45—> M207. 

Representation in haplogroup: Brahmins 52.3%; non- 
Brahmins 37.5%. 

This is the largest haplogroup in the Brahmin popula- 
tion and it is also one of the most common haplogroups in 
Europe. It originated as R-M207 in central Asia~27 kya 
(ISOGG 2016) and split into two subclades; a member car- 
rying the R1 mutation moved westward to the European 
subcontinent, and a member with the R2 mutation moved 
southward. R1 further bifurcated into subclades Rla and 
R1b about ~ 25 kya, with their mutations spreading to the 
middle East, Europe, the Russian steppes, central and South 
Asia. The origins of Rla are inferred to be from the Middle 
East, possibly near modern-day Iran (Underhill et al. 2015). 
Branches of haplogroup R, including Rla and R2a and their 
subclades, started to arrive in India~ 10 kya (Wells 2007), 
close to the beginning of agriculture and IVC in North India. 

In early 2009, a study attempted to prove that subclade 
Rlal reflected an indigenous origin of Brahmins in India 


Table 3 Brahmin and non-Brahmin communities and their representation in 12 haplogroups 


Haplogroups > > > Cc E F G 
Brahmin communities Total 

Chitpavan 7 

Deshastha 

Gour 3 

Iyengar 42 J 
Iyer 29 J J 
Kanyakubja 17 J 
Kokanastha 55 J 

Namboodiri 5 

Sakaldwipi 17 J 


Saraswat 319 J VJ VJ J 


Tamil 22 
Telugu 5 


Other Brahmins 677 VJ J J VJ 


All Brahmins 1,201. 31 15 13 30 
All non-Brahmins 2,196 60 13 68 72 
Total 3,397 91 28 81 102 
Percent all Brahmins 100% 2.6% 1.2% 1.1% 2.5% 
Percent all Non-Brahmins 100% 2.7% 0.6% 3.1% 3.3% 
Percent total 100% 2.7% 08% 24% 3.0% 


Ww 


H I J K L O Q 


=< 


i a 
SS 
a << 6 SMS UMS 
“x <4 
SS<<<<<<<<<< << 


2 OAS 


119 13 7 12 118 38 47 8 
372 15 3 35 262 170 62 4 
491 28 0 47 380 208 109 1,452 
9.9% 1.1% 114% 10% 98% 3.2% 3.9% 52.3% 
16.9% 0.7% 11.1% 16% 11.9% 7.7% 28% 37.5% 
145% 08% 11.2% 14% 11.2% 61% 3.2% 42.7% 


Twelve Y-DNA haplogroups of Brahmin (n= 1201) and non-Brahmin (n= 2196) communities 
Sources: Sengupta et al. (2006), Zhao et al. (2009), Sharma et al. (2009), Yadav et al. (2010,2011), Gosh et al. (2011), Parvathy et al. (2011), 


Genographic (2016), Mahal and Matsoukas (2018) 
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Table 4 Ancient geographic origins of 12 haplogroups of Brahmins, with markers and their estimated ages 


Haplogroup 


AYNOTrTA TT TOATDMO 


Marker 


M130 


M96 
M89 
M201 
M69 
M170 
M304 
M9 
M11 
M175 
M242 
M207 


Age 


~50 kya 

~30 to 40 kya 
~45 kya 

~ 10 to 23 
~30 kya 

~25 kya 

~15 kya 

~40 kya 

~25 to 30 kya 
~35 kya 

~15 to 20 kya 
~4to 27 kya 


Geographic origins 


Southern Asia, part of first migration out of Africa, coastal India to Southeast Asia 
Northeast Africa, part of second migration out of Africa, initially settled in Middle East 
Northeastern Africa or the Middle East (in 90% of all non-African men), parent of HG’s G to T 
Eastern edge of the Middle East, close to Himalayan foothills, Indus Valley 

South central Asia, known as the “Indian Marker” 

Europe, Near East, Central Asia, known as the “European haplogroup” 

Fertile Crescent (Mesopotamia, the land in and around the Tigris and Euphrates rivers) 
Iran or south-central Asia, diverged from the M89 Middle Eastern clan 

Pamir Knot region (Hindu Kush, Tian Shan, Himalayas) in Tajikistan 

Central or East Asia, part of M9 Eurasian clan, early Siberians 

Siberia (North Asia), descendants first arrivals in North America 


Central Asia (from the Caspian Sea to border of western China) 


Ancestral geographic origins of 12 Y-DNA haplogroups for Brahmins 


Sources: (Smolenyak 2004; Wells 2007; Y-DNA Haplogroup Tree, markers, and descriptions at ISOGG, https://isogg.org/tree/index.html) 


kya= 1000 years 


Twelve top level Y-DNA haplogroups (yellow) and markers (blue) for Brahmins 


| 
i 


M170 


(Ga- 


Fig. 1 A phylogenetic tree of 12 Brahmin haplogroups and their markers identified in this study 
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(Sharma et al. 2009). In late 2009, another study revealed 
that the haplotypes of nearly all 26 ancient human specimens 
from the Krasnoyarsk area in Siberia dated from the middle 
of the second millennium BCE to the fourth century CE 
belonged to subclade Rlal, which is thought to mark the 
eastward migration of early Indo-Europeans (Keyser et al. 
2009). Later, another study confirmed that subclade Rlal 
appeared all across Eurasia (Pamjav et al. 2012). It appears 
there were several migratory waves of people in haplogroup 
R from the Middle East and central Asia into North India. 
Haplogroup J 

Ancestral Markers on Tree: 
M168—> M89—> M304. 

Representation in haplogroup: Brahmins 11.4%; non- 
Brahmins 11.1%. 

The ancestor of this haplogroup was born in the Middle 
East area known as the Fertile Crescent (Smolenyak and 
Turner 2004), from somewhere around the Zagros moun- 
tains in modern south-western Iran ~ 15 kya. Although some 
early forms of cultivation may have started in North India, it 
is postulated that this haplogroup brought additional meth- 
ods of agriculture and participated in laying grounds for the 
IVC. It accounts for about 13% of all Indian Y chromosomes 
(Kivisild et al. 2003). 


Phylogenetic 


Haplogroup H 
Ancestral Markers on Tree: 
M168—> M89—> M69. 

Representation in haplogroup: Brahmins 9.9%; non- 
Brahmins 16.9%. 

The marker M69 of this haplogroup started in west- 
ern India ~ 30 kya (Wells 2007). This haplogroup is often 
referred to as the “Indian haplogroup.” It has a large repre- 
sentation in the Indian subcontinent and suggests an indig- 
enous Indian origin because of its age (Cordaux et al. 2004). 
This haplogroup represents the early hunter-gatherers of 
North India who later participated in the development of 
IVC. 


Phylogenetic 


Haplogroup L 
Ancestral Markers on Phylogenetic Tree: 
M168—> M89—> M9—> M20—> M11. 

Representation in haplogroup: Brahmins 9.8%; non- 
Brahmins 11.9%. 

The origins of this haplogroup can be traced to the rugged 
and mountainous Pamir Knot region in Tajikistan ~ 30 kya 
(Wells 2007). It evolved in India about ~ 14 kya (Basu et al. 
2003). Studies suggest that this may be one of the original 
haplogroups of the creators of IVC (Sengupta et al. 2006). 


It has a frequency of about 28% in western Pakistan and 
Baluchistan, from where the early creators of IVC emerged 
(Qamar et al. 2002). It is observed in lower frequencies in 
Tajikistan, Uzbekistan and Russia (McElreavey and Quin- 
tana-Murci 2005). 


Discussion 
Potential errors in haplogroup predictions 


A study of 119 males in Argentina to determine Y-STR 
haplogroups with Whit Athey’s Haplogroup Predictor—the 
software used in this study—showed that the results were 
not totally accurate (Muzzio et al. 2011). Another study of 
165 males in Nicaragua with the same software produced 
accurate results for 95.2% of the sample, and 4.8% of the 
results were inaccurate (Nunez et al. 2012). 

Athey has explained that the main drawback in his soft- 
ware is the size of the database of some Y-STR haplotypes 
from which the allele frequencies are calculated. For most 
haplogroups the database contains sufficient data. However, 
for some haplogroups, namely, C, H, L, N, and Q, the data- 
base is smaller, and the results may be prone to error (Athey 
2006). 

Of the 1201 records in the Brahmin dataset, 361 or about 
30% of the total were processed through Whit Athey’s soft- 
ware. Of these, 323 haplotypes belonged to the error-prone 
haplogroups C, H, L, and Q, identified by Athey. Assuming 
an error rate of 5%, as reported in the Nicaraguan study 
(Nunez et al. 2012), only eighteen haplotypes (5% of 361) 
may have identified haplogroups incorrectly. That represents 
a potential error rate of about 1.5% (18/1201) in the haplo- 
groups and geographic origins identified in this study. 


Population movements in the Indian subcontinent 


It has been reported that most current ethnic groups in the 
Indian subcontinent descended from a mixture of two diver- 
gent populations. They are broadly defined as the Ancestral 
North Indians (ANI) who were genetically related to Cen- 
tral Asians, Middle Easterners, Caucasians, and Europeans, 
and the Ancestral South Indians (ASI) who were not closely 
related to any groups outside of the subcontinent (Reich 
et al. 2009). The ANI and ASI definitions do not relate to 
current geographic habitats. Although Brahmins descended 
from the ANI, they dwell in all different geographic parts of 
India, including the south, as shown in Table 1. Similarly, 
many ASI ethnic groups are now scattered all over India. 
The ANI were found to be a mixture of about 50% ances- 
try related to the Yamnaya and about 50% ancestry related 
to ancient Iranian farmers, whereas the ASI were a mixture 
of about 25% ancestry related to ancient Iranian farmers 
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and about 75% ancestry related to previously established 
local hunter-gatherers of South Asia (Reich 2019). The Ira- 
nian farmer ancestry from the Fertile Crescent appears to 
be reflected in haplogroup J. A recent study found signifi- 
cantly elevated ratios of Yamnaya related steppe ancestry in 
two Brahmin communities in North India (Narasimhan et al. 
2019), echoing their ancient roots. 

The data indicate that most haplogroups in the Indian 
subcontinent advanced during the last 10,000 years. There 
were eight lineage expansions involving haplogroups R, H, 
and L between ~ 4.0 and ~7.3 kya, with the most prominent 
occurring within subclade R1 between ~ 4.0 and 4.5 kya, 
before the collapse of IVC (Poznik et al. 2016). 


Indo-European languages 


One of the early studies suggested that the Indo-European 
language family—which includes Sanskrit—was brought 
to Europe by Anatolian farmers at the onset of the Neo- 
lithic period (Renfrew 1992). Shortly thereafter, Maria 
Gimbutas proposed an opposing theory known as the Kur- 
gan hypothesis, or the Steppe theory, which identified the 
Russian steppes as the Indo-European urheimat (Gimbutas 
1993). Another study proposed that the early movements of 
admixed populations across the European steppe to the Altai 
mountain region involved speakers of the Indo-European 
languages (Gonzalez-Ruiz et al. 2012). 

Other recent studies have confirmed that the spread of 
Yamnaya people from the Pontic-Caspian steppe to northern 
Europe and central Asia was consistent with the hypothe- 
sized expansion of Indo-European languages (Allentoft et al. 
2015; Haak et al. 2015). According to Professor Majumder, 
extant populations inhabiting northern regions of India have 
shown affinities with Indo-European speaking populations 
of central Asia (Majumder 2010). It can be inferred from 
another study that members of haplogroups R, J, and L who 
migrated to India from central Asia and the Fertile Cres- 
cent, were associated with speakers of the Indo-European 
languages (Cordaux et al. 2004). 


Emergence of Indo-Aryans 


The nomadic people known as the Yamnaya occupied the 
steppes of southern Russia and eastern Ukraine between 
about 4500 and 2500 BCE. Their two branches known as 
the Sintashta (2200-1800 BCE) and Andronovo (2000-900 
BCE) are identified with the expansion of people known 
as Indo-Aryans in the steppes around present day Kazakh- 
stan, Uzbekistan, Tajikistan, and Turkmenistan (Anthony 
2007; Kuz’ mina and Mallory 2007). The Sintashta are cred- 
ited with crafting the wheel and domesticating the horse 
about ~7 kya (Curry 2019), which facilitated their expansion 
to other parts of Europe, central Asia, and beyond. 
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Viktor Sarianidi, a Soviet archaeologist discovered 
the Bactria Margiana Archaeological Complex (BMAC) 
which is located above the Amu Darya river in the Pamir 
Mountains and represents a Bronze age Oxus Civilization 
of around 4000 BCE (Sarianidi 2007). Sarianidi found evi- 
dence of sacred alters; traces of poppy seeds, cannabis, and 
ephedra, ingredients for an intoxicating drink called soma 
(used in some ancient Vedic rituals); horse sacrifices; four 
wheeled chariots; and other connections with the Aryan 
teachings (Wood 2007). 

According to recent research, the steppe pastoralists (i.e., 
the descendants of Yamnaya) migrated toward south Asia, 
impacted BMAC but apparently bypassed it to move further 
toward India, later appearing as some outlier communities 
by the turn of the second millennium BCE (Narasimhan 
et al. 2019). An earlier study by Silva et al. indicated that 
the Andronovo and Sintashta people, also descendants of the 
Yamnaya, infiltrated and dominated the soma-using BMAC 
by ~3.5 kya—possibly earlier—and proposed that Rla was 
a “highly plausible” subclade for the spread of Indo-Aryans 
into south Asia (Silva et al. 2017). 

This study concurs with Silva et al., because the results 
are widely supported: (a) the Indo-Aryans appear to have 
borrowed their distinctive religious beliefs and practices 
from the BMAC culture (Beckwith 2011), (b) two Indolo- 
gists have previously associated the Indo-Aryans with 
BMAC (Mallory 1989; Parpola 1999), (c) some BMAC 
materials have been found in IVC sites (Lamberg-Karlovsky 
2002; Anthony 2007), and (d) the soma-drinking practices 
of the Indo-Aryans are documented in early books of Indian 
history (Havell 1918; Smith 1921). 

Based on DNA analyses of one human skeleton from 
Rakhigarhi—an IVC site—and eleven outliers from about 
2800 to 2300 BCE, a recent study found little or no genetic 
traces of Central Asian steppe pastoralists or Iranian farm- 
ers/herders, suggesting these people were not abundant in 
northwest India at the time (Shinde et al. 2019). However, 
the report also acknowledged that the sample size was under- 
powered for any broad conclusions of this type. By 2000 
BCE, the population of IVC was estimated to be about five 
million people (McEvedy and Jones 1978), suggesting the 
presence of several different genetic lineages, as elucidated 
in this study. 

Emerging primarily from haplogroup R from beyond 
the far side of Afghanistan, the Indo-Aryans—ancestors of 
Brahmins—started migrating south and arrived in North 
India during the second millennium BCE (Erdosy 1995; 
Joseph 2018), probably earlier. By then IVC had started to 
decline and de-urbanize, possibly due to environmental deg- 
radation and conflicts between the newcomers and the earlier 
settlers (Cavalli-Sforza and Cavalli-Sforza 1995). Renfrew’s 
elite dominance model explains the intrusion of a small 
but well-organized group which can dislodge an existing 
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population and take over the region (Renfrew 1992), which 
the Indo-Aryans apparently did. 

Evidently, in addition to their religious beliefs the new 
migrants were aggressive warriors. It is reported that Alex- 
ander the Great’s encounter with Brahmin priests was not 
peaceful. The Brahmins incited a holy war and all along his 
route Alexander met some of the fiercest resistance during 
his campaign. His army participated in several battles with 
the locals and one of these conflicts culminated in a lethal 
wound to Alexander that prompted his retreat from India 
(Fildes and Fletcher 2002). 


Conclusion 


By exploiting an in-silico analyses of Y-DNA profiles of 
1201 Brahmin men, this study concluded that this ethnic 
group did not originate from a single ancestral population, 
but instead emerged from at least 12 progenitors who, with 
one exception (haplogroup H), emerged from foreign lands 
that were geographically near or coterminous with India. 

There was a convergence of haplogroups in North India 
starting about~ 12 kya. Some members of haplogroups R, 
H, and L were already inhabiting the area, mostly as hunt- 
ers and gatherers. Members of haplogroup J arrived from 
the west and brought their knowledge of agriculture to the 
region. These four haplogroups (R, J, H and L) represented 
about 83% of the Brahmins in this study. The populations 
admixed and became involved with creation of the IVC in 
North India. More members of other haplogroups arrived in 
a series of waves, and newer members of haplogroup R—the 
BMAC related Indo-Aryan ancestors of Brahmins—brought 
Sanskrit and tenets of the Vedic religion with them. After the 
demise of IVC, starting around the second millennium BCE, 
some inhabitants, mostly the newcomers, remained in the 
north and others dispersed to the eastern and southern parts 
of India, resulting in the ANI and ASI population clusters. 
Over time, the Brahmins expanded and spread Hinduism 
throughout the Indian subcontinent. As the result of religious 
conversions and admixture, at least 12 genetic lineages or 
Y-DNA haplogroups developed in this ethnic group. A larger 
data sample may uncover a few additional lineages. 

This study provided a contemporary assessment of 
the diverse ancient origins of this major ethnic group in 
the Indian subcontinent. It is expected that these findings 
will lead to additional insights about how the haplogroups 
expanded and evolved into various communities of Brah- 
mins and other ethnic groups in the Indian subcontinent. 
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